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MUTATED CLASS H MAJOR HISTOCOMPATIBILITY PROTEINS 

CROSS REFERENCE TO RELATED APPLICATIONS 
This application is a Continuation-in-Part of United States Provisional Application No. 
60/254,248, filed December 8, 2000. 

BACKGROUND OF THE INVENTION 
The field of the present invention is molecular biology, in particular, as it is related to 
combinatorial libraries of immune cell proteins displayed on the cell surface of a recombinant host 
cell. More specifically, the present invention relates to a library of major histocompatibility locus 
proteins displayed on the surfaces of recombinant yeast cells, to mutant MHC Class II and 
proteins selected for improved binding to particular target peptides, to mutant MHC proteins 
selected for binding to a particular antigen, to MHC Class II proteins of improved stability and 
to the use of the selected high affinity and/or more stable MHC derivatives in diagnostic methods 
and imaging assays, among other applications including prophylactic and therapeutic treatments. 

Proteins encoded by the major histocompatibility complex (called MHC proteins) are 
requisite components of the antigenic complexes that are involved in many diseases. These 
diseases include cases where the body reacts with one's own MHC proteins (in various 
autoimmune diseases) or infectious diseases and cancer, where the MHC are critical in binding 
and presenting foreign, antigenic peptides. In this invention, MHC proteins the class II type were 
expressed as heterologous, surface-linked fusions on yeast cells with the goal of generating 
improved MHC proteins. Libraries of mutant MHC and mutant peptide-MHC complexes can be 
screened for higher surface levels in order to identify variants that exhibited improved properties, 
including enhanced stability. For the first time, this system allows the directed evolution of MHC 
molecules that represent novel agents for various diagnostic and therapeutic applications. These 
agents could be used for treatment or prevention of cancer, infectious diseases (e.g., virus 
infections), and autoimmune diseases (e.g., multiple sclerosis, type I diabetes, rheumatoid 
arthritis). 



A number of strategies have been used or proposed to suppress autoimmune diseases, 
most notably drugs, such as cyclophosphamide, cyclosporin A, methotrexate, and Imuran 
(azathioprine). Steroid compounds, such as prednisone and methylprednisolone, are also 
employed in many instances. These drugs have limited long term efficacy against both cell- and 
antibody-mediated autoimmune diseases. Use of such drugs is limited by virtue of their toxic side 
effects that include "global" immunosuppression. Prolonged treatment with these drugs inhibits 
the normal protective immune response to pathogenic microorganisms, thereby increasing the risk 
of infections. A further drawback is that immune-mediated elimination of aberrant cells is 
impaired and there is, thus, an increased risk that malignancies will develop in patients receiving 
prolonged global immunosuppression. 

The self substances, or autoantigens, which are the targets of autoimmune responses are 
most often protein products unique to the targeted cells (e.g., hormones such as insulin dependent 
diabetes mellitus, IDDM); particular enzymes unique to the specialized function of targeted cells 
(e.g., glutamic acid decarboxylase or GAD in IDDM, or 21 hydroxylase in Addison's disease); 
specialized cell-specific receptor molecules (e.g. the thyroid stimulating hormone (TSH) receptor 
in Graves' disease or acetylcholine receptors in the neuromuscular junctions in myasthenia gravis); 
and/or structural constituents of the targeted cells or tissues (e.g., beta cell sialo-glycoconjugate 
in IDDM). Prior to the current invention, immunization with autoantigens has been used as a 
means to induce autoimmune disease in experimental animals. For example, the administration 
of myelin basic protein (MBP) has been used as a means to induce experimental allergic 
encephalomyelitis (EAE, a model for MS) in mice. Additional treatments and prophylaxes are 
needed. 

There is a long felt need in the art for Class II MHC proteins and Class II MHC/peptide 
complexes with improved stability and/or with improved T cell regulatory properties. Such 
improved Class II MHC proteins or complexes are useful in acting as antagonists of T cells that 
participate in the inappropriate removal of target cells or tissue. The improved Class II MHC 
proteins and complexes of the present invention are also improved for use as research tools in 
view of their improved stabilities. There is also an urgent need for prophylactic treatments to 
prevent serious autoimmune diseases such as Type I diabetes. 



SUMMARY OF THE INVENTION 
The invention provides a combinatorial library of class II MHC proteins displayed on the 
surfaces of recombinant host cells, for example, yeast cells, desirably, Saccharomyces cerevisiae. 
From such a library can be isolated mutant MHC proteins that exhibit greater affinity for a ligand 
or a ligand peptide than the wild type Class II protein and/or Class II MHC proteins and Class II 
MHC/peptide complexes that are improved in biochemical stability over the corresponding wild 
type proteins and complexes. 

Suitable labels allowing for use of a stable peptide-Class II MHC chimeric protein 
complex, especially a mutant Class II MHC protein or peptide-MHC complex having improved 
stability and/or improved binding, directly or indirectly, include but are not limited to fluorescent 
compounds, chemiluminescent compounds, radioisotopes, chromophores, and others. The labeled 
protein or complex of the present invention, where it specifically binds to a peptide of interest 
with high affinity and specificity, can be used in diagnostic tests to the particular type of 
autoimmune disease by virtue of the specific binding of the peptide-MHC Class II complex to a 
specific T cell receptor protein, and it can be used in the body in imaging tests to locate and/or 
estimate extent of autoimmune damage in progress, or it can be used as an antagonist or drug to 
eliminate T cells that cause autoimmune damage, potential or in progress. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 provides a ribbon diagram corresponding to the crystal structure of a class II 
major histocompatibility protein (the non-IDDM linked allele, IA d ) with highlighted amino acids 
of particular interest. The crystal structure of the non-IDDM linked murine allele, IA d , is shown, 
displaying the influenza HAi 26 . 13g peptide (hatched) (PDB code 2IAD; Scott et al., 1998). The 
IA d p-chain and a-chain are highlighted in black and open shapes, respectively. IA g7 shares the 
same a-chain as IA d , but differs by 17 residues in the p-chain including p56H and p57D shown 
in blue. P56H and P57S residues of IA g7 prevent a salt-bridge with a76R, perhaps contributing 
to its instability. 

Figure 2 provides diagrams for the genetic engineering of two single-chain Class II MHC 
constructs cloned into the yeast vector pCT302. The thrombin cleavage site allows the 



production of single chain, soluble Class II MHC protein complex (top diagram). In the 
alternative, the IA g7 construct allows for the insertion of a DNA fragment encoding a peptide 
ligand of the IA g7 molecule upstream of that region (bottom diagram). The GAD65 and insulin 
B9-23 peptides are important in the autoimmune destruction of insulin producing cells in the 
5 development of Type I Diabetes. The BDC2.5 (A) peptide is associated with IDDM as well. 

Figure 3 shows the results of flow cytometric analyses of various scIA g7 constructions. 
The recombinant yeast cells expressing the scIA g7 were prepared, washed and incubated anti-HA 
mAb 12CA5 (Boehringer Mannheim, Indianapolis, IN), anti-c-wyc mAb 9E10 (1:50 dilution of 
10 raw ascites fluid; Berkeley Antibody Co., Richmond, CA), or 10 ug/ml anti- IA g7 mAb 10.216 
purified from hybridoma supernatant. Cells were then incubated with FITC-labeled F(ab') 2 goat 
anti-mouse IgG (1:50) (Kirkegaard and Perry Labs, Inc., Gaithersburg, MD), and labeled cells 
p were analyzed by flow cytometry. As observed in the flow histograms, the Aga-2-IA g7 fusion in 
s7 single chain format (Aga-2-HA-P-chain-linker-a-chain-c-wyc) was not detectable on the yeast 
Is = surface, which is consistent with the instability of the class II product. The Aga-2-IA g7 fusion with 
yy 3 peptides(GAD65[78-96],B9-23[Insulin],BDC2.5 [alanine stabilized variant]) linked at the IA g7 
~ amino terminus a-chain exhibited low or undetectable levels of expression. 

H Figure 4 summarizes the production of mutagenic libraries in order to generate stabilized 

2g MHC Class II, IA 87 . The random mutagenesis strategy employed the use of error-prone PCR with 
Primers 1 and 4 as described hereinbelow, and yeast homologous recombination to generate 
mutagenic libraries. Alternatively, a directed mutagenesis strategy can be employed, where 
degenerate Primers 2 and 3 and flanking Primers 1 and 4 are used in PCR reactions also as 
described hereinbelow. Isolated clones from each mutagenic strategy were rescued and 
25 sequenced to verify mutagenesis. Approximately 4-7 nucleotide errors were incorporated per 
1000 base pairs in the random scIA g7 libraries. 

Figure 5 provides the result of sorting yeast homologous recombination mutagenic 
libraries (GAD65 [78-96] scIA g7 ). Six random mutagenic scIA g7 yeast libraries were constructed 
30 with 10 5 -10 6 independent transformants. Two scIA g7 {356(357 directed mutagenic libraries were 
constructed with 10 4 - 10 s total independent transformants. The mutagenic libraries were then 



4 



screened using a Cytomation MoFlo sorter (Cytomation, Fort Collins, CO) to isolate stabilized 
Aga-2-IA g7 fusions. 

Figure 6 shows the results of sorting libraries generated by random mutagenesis. 
5 GAD65(78-96), scIA g7 and B9-23 scIA g7 yeast libraries were incubated with 25 ul anti-IA 87 mAb 
10.216 (10 ug/ml), washed with buffer (PBS/0.5% BSA), and incubated with FITC-labeled 
F (ab') 2 goat anti-mouse IgG (1:50). After washing, samples were sorted in purification mode 
(coincident negative cells rejected) on a Cytomation MoFlo sorter with an event rate of - 50,000 
cells. A total of 2 x 10 7 cells were examined during the first sorting round, collecting ~ 1% of the 
1 0 population. Collected cells were re-grown at 3 0 ° C in selective glucose medium for ~ 1 8-20 h, and 
scIA g7 surface expression was induced at 20 ° C in selective galactose medium. After three more 
rounds of sorting with anti-IA g7 mAb 10.216, the sorted libraries were incubated with 25 ul anti-c- 
myc mAb 9E10 (1 :50), washed with buffer (PBS/0. 5% BSA), incubated with FITC-labeled F(ab') 2 

£T goat anti-mouse IgG, and sorted, collecting the top 0.25% of the population. The collected cells 

i is 

15 were plated on selective glucose medium to isolate individual clones. Clones were further 
w examined using flow cytometry by staining with anti-IA 87 mAb 10.216 (Fig. 6B) and mti-c-myc 
I mAb 9E10 (Fig. 6 A) followed by FITC-labeled F(ab') 2 goat anti-mouse IgG. Plasmids from 

Q sorted scIA g7 yeast cells were rescued with a Zymoprep Miniprep kit (Zymo Research, Orange, 
CA). Rescued plasmid DNA was then transformed into E. coli DH10B competent cells by 
§1 electroporation. Transformants were plated on LB plates supplemented with 100 ug/ml 
ampicillin. Sequencing was performed using scIA g7 flanking primers splice 4/L and T7 promoter, 
and a scl A 87 p-chain specific primer, scIA g7 a/pXNK (5 '-CC A GGA C AG AGG CCC TC A AC-3 ', 
SEQ ID NO: 1), using fluorescence automated sequencing. Mutations in Mut 8 include GB 13 A, 
Sp57L, Wa43S and Val39D. 

25 

Figures 7A-7B show the results of sorting exemplary GAD65 and B9-23scIA g7 error- 
prone library or yeast cells expressing the wild type B9-23scIA g7 cell surface proteins with either 
anti-c-wyc or anti-IA g7 antibodies. Residues differing from the wild type MHC Class II protein 
are shown at the bottom of the figure. 

30 
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Figure 8 shows sequences of clones isolated by sorting from GAD65(78-96) scIA g7 and 
B9-23 scIA g7 error-prone PCR libraries. The scIA g7 wild type amino acid sequence and residue 
numbers are shown with corresponding residue mutations of GAD65(78-96) scIA g7 and B9-23 
scIA 87 error-prone clones. Multiple independent mutations were observed in both the scIA g7 P- 
chain and scIA g7 a-chain. This suggests that at least one of each of these mutations is linked to 
the increased stability of GAD65 (78-96) scIA g7 and B9-23 scIA g7 mutants. 

Figures 9A-9C show the results of successive sorts of the GAD65 scIA g7 p5657 library 
generated by directed mutagenesis. The GAD65 scIA g7 p5657 library was incubated with 25 ul 
anti-IA g7 mAb 10.216 (10 ug/ml), washed with buffer (PBS/0.5% BSA), and incubated with 
FITC-labeled F(ab') 2 goat anti-mouse IgG (1:50). After washing, samples were sorted in 
purification mode using a Cytomation MoFlo sorter. A total of 2 x 10 7 cells were examined 
during the first sorting round, collecting ~ 0.25% of the population. Collected cells were re- 
grown at 30° C in selective glucose medium for -18-20 h and scIA g7 surface expression was 
induced at 20° C in selective galactose medium. Following the second sort with anti-IA g7 mAb 
10.216, the sorted library was incubated with 25 ul anti-c-myc mAb 9E10 (1:50), washed with 
buffer (PBS/0.5% BSA), incubated with FITC-labeled F(ab') 2 goat anti-mouse IgG, and sorted, 
again collecting the top 0.25% of the population. Sorted clones were further analyzed by flow 
cytometry. 

Figures 10A-10B show the results obtained with an exemplary clone isolated by sorting 
the GAD65 scIA g7 p5657 library. Results are shown for sorts with an anti-c-myc antibody and 
with anti-IA E? antibody. 

Figure 1 1 shows the results of rapid (one day) sequential sorting of the randomly mutated 
BDC2.5 sc IA 87 P5657 library. The BDC2.5 scIA g7 (35657 yeast library was stained with 12.5 ul 
anti-IA g7 and biotin-labeled anti-c-/wyc antibody, incubated with FITC-labeled F(ab') 2 goat anti- 
mouse, y 2b chain specific, IgG 2b and streptavidin-phycoerythrin (SA:PE) conjugate. After 
washing, samples were sorted in purification mode (coincident negative cells rejected) using a 
fluorescence activated cell sorter. About 1% of the total cells examined in the first sort were 
collected. The collected cells were sequentially sorted twice more on the same day, collecting the 



top 1% of the population each time. The cells collected from the third sort were plated and then 
further examined by flow cytometry. 



Figure 12 provides a summary of clones isolated by fluorescence activated cell sorting 
5 from a BDC2.5 scIA g7 p5657 mutant library. Binding levels are shown as a % positive population 
shift to anti-c-myc mAb and anti- IA g7 mAb from BDC2.5 scIA g7 p5657 clones isolated from the 
final sequential sort. Nine mutants in addition to BDC2.5 scIA 87 wild type yeast, B9-23 scIA 87 
Mut8/yeast (anti- IA g7 mAb positive control), 7Msc4F 1 0/yeast (anti-c-wyc mAb positive control), 
and EBY100 (negative control) yeast were induced in galactose medium overnight at 30°C. Cells 
10 were analyzed by flow cytometry after staining with anti-c-wyc mAb (black, stippled bars), or 
stained with anti- IA g7 mAb (cross-hatched bars) followed by FITC-labeled F(ab') 2 goat anti- 
mouse IgG. Mutants isolated yielded higher surface level binding to anti-c-myc and anti- IA g7 
O antibodies than its BDC2.5 scIA g7 wild type counterpart. BDC2.5 scIA g7 J35657 mutants were 
M= sequenced and contained the consensus motifs of E/G 56 and L/M 57 . 
P 

jj? Figure 1 3 shows the binding peptide B- 1 040-63 to IA g7 transacted L cells. See Example 

s 2 for experimental details. 

Figure 14A provides a diagram of a scIA g7 p^ fusion, and Figure 14B provides a diagram 
ip of a peptide scIA g7 p^ fusion. 

DETAILED DESCRIPTION OF THE INVENTION 
The role of proteins encoded by the major histocompatibility complex (called MHC 
proteins) has now been known for over twenty years. MHC proteins are expressed by every 

25 individual and function as "antigen-presenting" molecules. That is, each MHC protein can bind 
to a variety of different small peptides (8 to 20 amino acids in length) that are derived from 
proteins present inside a cell. MHC proteins present both self-peptides (i.e., derived from an 
individual's own endogenous proteins) and foreign peptides (i.e., derived from a foreign agent 
such as a virus). Once a peptide is bound to the MHC protein, the entire peptide-MHC complex 

30 (pMHC) is expressed on the surface of the cell. If the peptide is foreign, a T lymphocyte (T cell) 
can potentially recognize the complex, and the resultant interaction of the T cell receptor (TCR) 
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and the pMHC can result in T cell activation. T cell activation can lead to recruitment of other 
immune cells and a corresponding inflammatory reaction. Such inflammatory reactions are 
beneficial if the pMHC target antigen is in fact derived from an infectious agent or from a 
neoplastic cell (i.e., cancer). However, such inflammatory reactions can be very detrimental if the 
pMHC target antigen is derived from self tissue, as the reactions can lead to severe autoimmune 
disease, where an individual's immune system attacks normal tissue. Such is the case when a 
patient's lymphocytes attack the islet cells of the pancreas (type I diabetes), the nervous system 
(multiple sclerosis), or joint-derived components (rheumatoid arthritis). 

The central role of pMHC complexes in these phenomena has been established by 
thousands of published studies that include genetic linkages of diseases to the human MHC locus 
(HLA). It has now also been established that it is possible to use appropriately characterized 
peptide-MHC molecules as either agonists of an immune response (e.g., in cancer and infectious 
diseases) or as antagonists of responses (e.g., in autoimmune responses). While several 
approaches have been taken to produce such pMHC complexes in soluble forms for these 
purposes and for biochemical/structural studies, it has not been possible to use current methods 
of in vitro directed evolution to improve the stability or antigenicity of the pMHC complex. This 
is because the pMHC complex is normally a membrane-associated complex composed of multiple 
different subunits (heavy chain, beta-2-microglobulin, and peptide in the case of a class I MHC, 
and cc-chain, j3-chain, and peptide in the case of class II MHC) and such proteins are typically not 
amenable to the current methods of directed evolution (primarily phage display). The present 
invention shows that a display system for directed evolution can be used to express properly 
folded class I and class II MHC proteins on the surface of yeast. The displayed peptide-MHC 
complexes can be used to directly activate T cells, for treatment or in order to identify/screen for 
pMHC antigens. In addition, mutated libraries of the pMHC proteins can be created and used for 
selection by flow sorting of stabilized pMHC variants. The stabilized variants could be identified 
because they were expressed at higher levels on the yeast surface and could therefore be easily 
identified by using a fluorescent-labeled probe for the pMHC construct, combined with high- 
throughput flow cytometric sorting of such cells. 
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The MHC Class II proteins have been associated with susceptibility and resistance to 
autoimmune disorders. Insulin dependent diabetes mellitus (IDDM) has been linked to certain 
murine I-A alleles and human HLA-DQ homologues. In all cases where an MHC Class II allele 
is associated with IDDM, there is a lack of D57 in the P chain of the MHC protein, whereas the 
5 D57 residue is present in the non-IDDM-linked alleles. The T H 1 inflammatory response which 
results in destruction of islet cells in IDDM is believed to result from poor central tolerance or 
promiscuous peptide binding of disease-linked MHC proteins. 

The non-obese (NOD) mouse is the generally accepted model for the study of IDDM. 
10 NOD mice spontaneously develop IDDM early in life due to the disease-associated MHC II 
haplotype I-A g7 . I-A g7 shares the same a chain as I-A d , but it differs by 17 residues in the P chain, 
including the P56H and P57S, which confer the I-A g7 diabetogenic character. Replacement of 
□ the I-A g7 p56H and P57S residues drastically reduces the incidence of diabetes in NOD mice. 
y[ Populations of I-A g7 have been shown to be susceptible to sodium dodecyl sulfate denaturation, 
W and they have relatively weak peptide binding, perhaps allowing the T cells to escape negative 
Ly selection (purging of self-reactive clones) in the thymus. 

Jf; With the goal of isolating mutant forms of Class II MHC proteins and protein/peptide 

i y 

M= complexes, a single chain fusion protein has been expressed through the use of genetic engineering 
2g technology. See, e.g., WO 99/36569, incorporated by reference herein, for a discussion of yeast 
surface display technology and vectors. See also United States Provisional Application No. 
60/254,495 filed December 8, 2000, also incorporated by reference herein. As specifically 
exemplified herein, the Class II protein is expressed as a single chain protein of the format AGA2- 
HA-P chain-linker-a chain-c-/wyc. The wild type fusion protein is not detected on the yeast cell 
25 surface, thus reflecting the instability of the wild type Class II MHC protein. 

Class II I- A g7 has been cloned as an AGA2 fusion with 3 peptides (GAD65, insulin B9-23 
and BDC 2.5(alanine stabilized variant of the BDC2.5 peptide mimic, GKKVAAPVWIRMG, 
SEQ ID NO:21) linked at the amino terminus of the fusion protein. Low or undetectable 
30 expression levels result. 
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To create stabilized I-A g7 variants, eight different mutational libraries were produced by 
error prone polymerase chain reaction (PCR) to produce random mutations, and oligonucleotide 
site directed mutagenesis of residues 56 and 57 was carried out using homologous recombination 
after co-electroporation of the mutated coding sequence-containing nucleic acid molecules and 
linearized vector (pCT302 or pYDl, available from Invitrogen, Carlsbad, CA). Sorting of the 
randomly mutated GAD65 and B9-23 libraries with anti-c-wyc and anti-I-A g7 antibodies yielded 
many mutants with higher surface levels of the fusion protein, indicating increased stability of the 
molecule. Sorting of the BCD2.5 J356/057 library with anti-c-myc and anti-I-A 87 antibodies also 
yielded many mutants with higher surface levels of the fusion protein, indicating increased stability 
of the molecule. These mutants showed consensus motifs of E/G 56 and L/M 57 . 

The crystal structure of non-IDDM linked allele, I-A d is shown in Fig. 1. Insulin- 
dependent diabetes mellitus (IDDM) is associated with certain murine I-A alleles, such as IA g7 , 
and human HLA-DQ homologues. The crystal structure of a non-IDDM linked murine allele, IA d , 
is shown, displaying the influenza HA 126 . 138 peptide (PDB code 2IAD; Scott et al., 1998). The IA d 
0-chain and a-chain are also shown. IA 8? shares the same a-chain as IA d , but differs by 17 
residues in the p-chain, including p56H and P57D. p56H and p57S residues of IA g7 prevent salt- 
bridge formation with ct76R, perhaps contributing to its instability. 

The present invention allows the creation and isolation of stabilized variants of Class II 
peptide-MHC complexes. Toward this end, we have displayed single-chain peptide/Class II MHC 
complexes on the surface of yeast cells, and we have isolated stabilized variants of the I-A g7 
molecule in association with each of three peptides of interest. 

WO 99/36569 describes the yeast display technology in general terms. The MHC protein 
of interest is displayed on the yeast cell surface via a disulfide linkage through the AGA2 portion 
of the fusion protein comprising the MHC component. AGA2 is a mating adhesion receptor 
which is naturally bound to the cell surface in disulfide linkage to the AGA1 protein. The HA and 
the c-myc portions of the displayed fusion protein serve as epitope tags and can be used in 
normalizing the fluorescent peptide binding data. Each recombinant yeast cell displays about 
50,000 copies of the surface bound fusion protein (if stable) on its surface. A fluorescent antibody 
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or peptide ligand specific for the MHC protein of interest is added, and the cells are sorted using 
flow cytometry. Those MHC fusion proteins of increased stability exhibit stronger binding of the 
fluorescent ligand, and these cells are selected during the cell sorting procedure. 

Fig. 3 illustrates diagrammatically the pCT302 yeast surface display vector that contains 
a sequence encoding AGA2/HA-Class II MHC - c-myc fusion protein. This fusion protein coding 
sequence is expressed in yeast under the regulatory control of the inducible GAL1-10 promoter. 

The yeast display system was exploited to produce a random mutagenized library from 
which stabilized mutant Class II MHC sequences were isolated. Constructs encoding the fusion 
proteins were mutagenized randomly using error prone PCR (0.16 Mn:Mg molar ratio). A 
homologous recombination scheme was employed to create the libraries. 

In order to provide a clear and consistent understanding of the specification and claims, 
including the scope to be given to such terms, the following definitions are provided. 

A coding sequence is the part of a gene or cDNA which codes for the amino acid sequence 
of a protein, or for a functional RNA such as a tRNA or rRNA. 

Complement or complementary sequence means a sequence of nucleotides which forms 
a hydrogen-bonded duplex with another sequence of nucleotides according to Watson-Crick base- 
pairing rules. For example, the complementary base sequence for 5'-AAGGCT-3' is 3-TTCCGA- 
5'. 

Downstream means on the 3' side of any site in DNA or RNA. 

Expression refers to the transcription of a gene into structural RNA (rRNA tRNA) or 
messenger RNA (mRNA) and subsequent translation of a mRNA into a protein. 

An amino acid sequence that is functionally equivalent to a specifically exemplified class II 
MHC protein sequence is an amino acid sequence that has been modified by single or multiple 
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amino acid substitutions, by addition and/or deletion of amino acids, or where one or more amino 
acids have been chemically modified, but which nevertheless retains the binding specificity and 
high affinity binding activity of a cell-bound or a soluble MHC protein of the present invention. 
Functionally equivalent nucleotide sequences are those that encode polypeptides having 
substantially the same biological activity as a specifically exemplified cell-bound or soluble MHC 
protein. In the context of the present invention, a soluble MHC protein lacks the portions of a 
native cell-bound MHC and is stable in solution (i.e., it does not generally aggregate in solution 
when handled as described herein and under standard conditions for protein solutions). 

Two nucleic acid sequences are heterologous to one another if the sequences are derived 
from separate organisms, whether or not such organisms are of different species, as long as the 
sequences do not naturally occur together in the same arrangement in the same organism. 

Homology refers to the extent of identity between two nucleotide or amino acid 
sequences. 

Isolated means altered by the hand of man from the natural state. If an "isolated" 
composition or substance occurs in nature, it has been changed or removed from its original 
environment, or both. For example, a polynucleotide or a polypeptide naturally present in a living 
animal is not isolated, but the same polynucleotide or polypeptide separated from the coexisting 
materials of its natural state is isolated, as the term is employed herein. 

A linker region is an amino acid sequence that operably links two functional or structural 
domains of a protein. 

A nucleic acid construct is a nucleic acid molecule which is isolated from a naturally 
occurring gene or which has been modified to contain segments of nucleic acid which are 
combined and juxtaposed in a manner which would not otherwise exist in nature. 

Nucleic acid molecule means a single- or double-stranded linear polynucleotide containing 
either deoxyribonucleotides or ribonucleotides that are linked by 3'-5'-phosphodiester bonds. 
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Two DNA sequences are operably linked if the nature of the linkage does not interfere 
with the ability of the sequences to effect their normal functions relative to each other. For 
instance, a promoter region would be operably linked to a coding sequence if the promoter were 
capable of effecting transcription of that coding sequence. 

A polypeptide is a linear polymer of amino acids that are linked by peptide bonds. 

Promoter means a cis-acting DNA sequence, generally 80-120 base pairs long and located 
upstream of the initiation site of a gene, to which RNA polymerase binds and initiates correct 
transcription. There can be associated additional transcription regulatory sequences which 
provide on/off regulation of transcription and/or which enhance (increase) expression of the 
downstream coding sequence. 

A recombinant nucleic acid molecule, for instance a recombinant DNA molecule, is a novel 
nucleic acid sequence formed in vitro through the ligation of two or more nonhomologous DNA 
molecules (for example a recombinant plasmid containing one or more inserts of foreign DNA 
cloned into at least one cloning site. Alternatively, a recombinant DNA molecule can result from 
homologous recombination after co-transformation (or co-electroporation) of two DNA 
molecules sharing at least limited sequence identity. 

Transformation means the directed modification of the genome of a cell by the external 
application of purified recombinant DNA from another cell of different genotype, leading to its 
uptake and possibly its integration into the subject cell's genome. In bacteria, the recombinant 
DNA is not typically integrated into the bacterial chromosome, but instead replicates 
autonomously as a plasmid. 

Upstream means on the 5' side of any site in DNA or RNA. 

A vector is a nucleic acid molecule that is able to replicate autonomously in a host cell and 
can accept foreign DNA. A vector carries at least one origin of replication functional in at least 
one type of cell, one or more unique recognition sites for restriction endonucleases which can be 
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used for the insertion of foreign DNA, and usually selectable markers such as genes coding for 
antibiotic resistance, and often recognition sequences (e.g. promoter) for the expression of the 
inserted DNA. Common vectors include plasmid vectors and phage vectors. There can be more 
than one origin of replication to allow for replication and maintenance in more than one type of 
cell (e.g., separate origins of replication functional in yeast and Escherichia coli, respectively). 

Autoimmune destruction of tissue, in the present context, means that the immune system 
of an individual or animal has inappropriately targeted that tissue for killing. One important 
example is the autoimmune destruction of the insulin producing islet cells of the pancreas, which 
results in insulin dependent diabetes mellitus (Type I diabetes). Another example is the 
destruction of myelin surrounding nerve fibers in multiple sclerosis. In the case of Type I 
diabetes, two peptide antigens have been identified as important in the autoimmune response. 
These include the "B9-23 H peptide of insulin (encompassing amino acids 9-23 of the B chain of 
human insulin) and a peptide derived from the 65 kDa glutamate decarboxylase protein (GAD65; 
amino acids 78-96). See Table 2 for the amino acid sequences of these peptides. 

The role of proteins encoded by the major histocompatibility complex (MHC proteins) 
have been known for over twenty years. MHC proteins are expressed by every individual, and 
they function as antigen-presenting molecules. Each MHC protein can bind to a variety of 
different small peptides (of 8 to 20 amino acids). T cells recognize a foreign peptide bound to the 
MHC product through the ocP heterodimeric T cell receptor (TCR). The TCR repertoire has 
extensive diversity created by the same gene rearrangement mechanisms used in antibody heavy 
and light chain genes [Tonegawa, S. (1988) Biosci. Rep. 8:3-26]. Most of the diversity is 
generated at the junctions of variable (V) and joining (J) (or diversity, D) regions that encode the 
complementarity determining region 3 (CDR3) of the a and (3 chains [Davis and Bjorkman (1 988) 
Nature 334:395-402]. However, TCRs do not undergo somatic point mutations as do antibodies 
and, perhaps not coincidentally. The serial-triggering [Valitutti et al. (1995)Mtfwn?375:148-151] 
and kinetic proofreading [Rabinowitz et al. (1996) Proc. Natl. Acad. Sci. USA 93:1401-1405] 
models of T cell activation both suggest that longer off-rates (associated with higher affinity) are 
detrimental to the signaling process. It is also possible that higher affinity TCRs do not maintain 
the peptide specificity required for T cell responses. For example, peptides bound within the 
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MHC groove display limited accessible surface [Bjorkman, P.J. (1997) Cell 89: 167-170], which 
may in turn limit the amount of energy that can be generated in the interaction. On the other 
hand, raising the affinity of a TCR by directing the energy toward the MHC helices leads to 
thymic depletion during negative selection [Bevan, M.J. (1997) Immunity 7:175-178]. 

In summary, we have shown that Class II MHC proteins and protein-peptide complexes 
can be engineered to yield proteins and complexes of increased biochemical stability. The 
stabilized Class II MHC derivatives are useful in diagnosis or study of certain autoimmune 
diseases, and they are useful as antagonists of T cell-mediated autoimmune destruction of target 
tissues, for example, the destruction of insulin producing islet cells of the pancreas in the 
development of insulin dependent diabetes mellitus. 

It will be appreciated by those of skill in the art that, due to the degeneracy of the genetic 
code, numerous functionally equivalent nucleotide sequences encode the same amino acid 
sequence. 

Additionally, those of skill in the art, through standard mutagenesis techniques, in 
conjunction with the antigen-finding activity assays described herein, can obtain altered class II 
MHC sequences and test them for the expression of polypeptides having particular binding 
activity or improved biochemical stability. Useful mutagenesis techniques known in the art 
include, without limitation, oligonucleotide-directed mutagenesis, region-specific mutagenesis, 
linker-scanning mutagenesis, and site-directed mutagenesis by PCR [see e.g. Sambrook et al. 
(1989) and Ausubel et al. (1999)]. 

In obtaining variant MHC Class II coding sequences, those of ordinary skill in the art will 
recognize that MHC-derived proteins can be modified by certain amino acid substitutions, 
additions, deletions, and post-translational modifications, without loss or reduction of biological 
activity. In particular, it is well-known that conservative amino acid substitutions, that is, 
substitution of one amino acid for another amino acid of similar size, charge, polarity and 
conformation, are unlikely to significantly alter protein function. The 20 standard amino acids that 
are the constituents of proteins can be broadly categorized into four groups of conservative amino 
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acids as follows: the nonpolar (hydrophobic) group includes alanine, isoleucine, leucine, 
methionine, phenylalanine, proline, tryptophan and valine; the polar (uncharged, neutral) group 
includes asparagine, cysteine, glutamine, glycine, serine, threonine and tyrosine; the positively 
charged (basic) group contains arginine, histidine and lysine; and the negatively charged (acidic) 
group contains aspartic acid and glutamic acid. Substitution in a protein of one amino acid for 
another within the same group is unlikely to have an adverse effect on the biological activity of 
the protein. 

Homology between nucleotide sequences can be determined by DNA hybridization 
analysis, wherein the stability of the double-stranded DNA hybrid is dependent on the extent of 
base pairing that occurs. Conditions of high temperature and/or low salt content reduce the 
stability of the hybrid, and can be varied to prevent annealing of sequences having less than a 
selected degree of homology. For instance, for sequences with about 55% G - C content, 
hybridization and wash conditions of 40 - 50 °C, 6 X SSC (sodium chloride/sodium citrate buffer) 
and 0.1% SDS (sodium dodecyl sulfate) indicate about 60 - 70% homology, hybridization and 
wash conditions of 50 - 65° C, 1 X SSC and 0.1% SDS indicate about 82 - 97% homology, and 
hybridization and wash conditions of 52°C, 0. 1 X SSC and 0. 1% SDS indicate about 99 - 100% 
homology. A wide range of computer programs for comparing nucleotide and amino acid 
sequences (and measuring the degree of homology) are also available, and a list providing sources 
of both commercially available and free software is found in Ausubel et al. (1999). Readily 
available sequence comparison and multiple sequence alignment algorithms are, respectively, the 
Basic Local Alignment Search Tool (BLAST) [Altschul et al., (1997) Nucl. Acids Res. 25:3389- 
3402] and ClustalW programs. BLAST is available on the Internet at 
http://www.ncbi.nlm.nih.gov and a version of ClustalW is available at http://www2.ebi.ac.uk. 

Industrial strains of microorganisms (e.g., Aspergillus niger, Aspergillus ficuum, 
Aspergillus awamori, Aspergillus oryzae, Trichoderma reesei, Mucor miehei, Kluyveromyces 
lactis, Pichiapastoris, Saccharomyces cerevisiae, Escherichia coli, Bacillus sub ti lis or Bacillus 
licheniformis) or plant species (e.g., canola, soybean, corn, potato, barley, rye, wheat) may be 
used as host cells for the recombinant production of the stabilized mutant Class II MHC proteins 
of the present invention. As the first step in the heterologous expression of a high affinity MHC 
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protein or soluble protein, an expression construct is assembled to include the MHC or soluble 
MHC coding sequence and control sequences such as promoters, enhancers and terminators. 
Other sequences such as signal sequences and selectable markers may also be included. To 
achieve extracellular expression of a soluble MHC polypeptide, the expression construct may 
include a secretory signal sequence. The signal sequence is not included on the expression 
construct if cytoplasmic expression is desired. The promoter and signal sequence are functional 
in the host cell and provide for expression and secretion of the MHC or soluble MHC protein. 
Transcriptional terminators are included to ensure efficient transcription. Ancillary sequences 
enhancing expression or protein purification may also be included in the expression construct. 

Various promoters (transcriptional initiation regulatory region) may be used according to 
the invention. The selection of the appropriate promoter is dependent upon the proposed 
expression host. Promoters from heterologous sources may be used as long as they are functional 
in the chosen host. 

Promoter selection is also dependent upon the desired efficiency and level of peptide or 
protein production. Inducible promoters such tac are often employed in order to dramatically 
increase the level of protein expression E. coli. Overexpression of proteins may be harmful to the 
host cells. Consequently, host cell growth may be limited. The use of inducible promoter systems 
allows the host cells to be cultivated to acceptable densities prior to induction of gene expression, 
thereby facilitating higher product yields. 

Various signal sequences may be used according to the invention. A signal sequence 
which is homologous to the TCR coding sequence may be used. Alternatively, a signal sequence 
which has been selected or designed for efficient secretion and processing in the expression host 
may also be used. For example, suitable signal sequence/host cell pairs include the B. subtilis 
sacB signal sequence for secretion in B. subtilis, and the Saccharomyces cerevisiae a-mating 
factor or P. pastoris acid phosphatase/?/?o7 signal sequences for P. pastoris secretion. The signal 
sequence may be joined directly through the sequence encoding the signal peptidase cleavage site 
to the protein coding sequence, or through a short nucleotide bridge consisting of usually fewer 
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than ten codons, where the bridge ensures correct reading frame of the downstream TCR 
sequence. 



Elements for enhancing transcription and translation have been identified for eukaryotic 
5 protein expression systems. For example, positioning the cauliflower mosaic virus (CaMV) 
promoter 1 000 bp on either side of a heterologous promoter may elevate transcriptional levels by 
10- to 400-fold in plant cells. The expression construct should also include the appropriate 
translational initiation sequences. Modification of the expression construct to include a Kozak 
consensus sequence for proper translational initiation may increase the level of translation by 10 
10 fold. 



A selective marker is often employed, which may be part of the expression construct or 
p separate from it (e.g. , carried by the expression vector), so that the marker may integrate at a site 
g different from the gene of interest. Examples include markers that confer resistance to antibiotics 
^ ( e - g -' bla confers resistance to ampicillin for E. coli host cells, nptll confers kanamycin resistance 
yj to a wide variety of prokaryotic and eukaryotic cells) or that permit the host to grow on minimal 

medium (e.g., HIS4 enables P. pastoris or His" S. cerevisiae to grow in the absence of histidine). 
H The selectable marker has its own transcriptional and translational initiation and termination 

regulatory regions to allow for independent expression of the marker. If antibiotic resistance is 
2g employed as a marker, the concentration of the antibiotic for selection will vary depending upon 
H- the antibiotic, generally ranging from 10 to 600 ug of the antibiotic/mL of medium. 

The expression construct is assembled by employing known recombinant DNA techniques 
(Sambrook et al., 1989; Ausubel et aL, 1999). Restriction enzyme digestion and ligation are the 

25 basic steps employed to join two fragments of DNA. The ends of the DNA fragment may require 
modification prior to ligation, and this may be accomplished by filling in overhangs, deleting 
terminal portions of the fragment(s) with nucleases (e.g., ExoUl), site directed mutagenesis, or 
by adding new base pairs by PCR. Polylinkers and adaptors may be employed to facilitate joining 
of selected fragments. The expression construct is typically assembled in stages employing rounds 

30 of restriction, ligation, and transformation of E. coli. Numerous cloning vectors suitable for 
construction of the expression construct are known in the art (A.ZAP and pBLUESCRTPT SK-1, 
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Stratagene, La Jolla, CA; pET, Novagen Inc., Madison, WI; cited in Ausubel et al., 1999) and 
the particular choice is not critical to the invention. The selection of cloning vector will be 
influenced by the gene transfer system selected for introduction of the expression construct into 
the host cell. At the end of each stage, the resulting construct may be analyzed by restriction, 
DNA sequence, hybridization and PCR analyses. 

The expression construct may be transformed into the host as the cloning vector construct, 
either linear or circular, or may be removed from the cloning vector and used as is or introduced 
onto a delivery vector. The delivery vector facilitates the introduction and maintenance of the 
expression construct in the selected host cell type. The expression construct is introduced into 
the host cells by any of a number of known gene transfer systems (e.g., natural competence, 
chemically mediated transformation, protoplast transformation, electroporation, biolistic 
transformation, transfection, or conjugation) (Ausubel et al., 1999; Sambrook et al., 1989). The 
gene transfer system selected depends upon the host cells and vector systems used. 

For instance, the expression construct can be introduced into S. cerevisiae cells by 
protoplast transformation or electroporation. Electroporation of S. cerevisiae is readily 
accomplished, and yields transformation efficiencies comparable to spheroplast transformation. 
Co-electroporation of a linearized vector and a linear DNA molecule of interest having regions 
of homology to the vector at each end results in homologous recombination within the yeast cell, 
thus circumventing the need for ligation in vitro prior to transformation of the yeast cells. 

Now that stabilized IA g7 mutants have been isolated by the process of directed evolution 
and yeast display technology as described herein, those IA g7 mutants that are capable of binding 
peptides are identified. To perform these studies we required a labeled-IA g7 -binding peptide that 
could be used as a probe for further selection of IA g7 mutants on yeast. We have used a 
biotinylated peptide called B- 1040-63 (Judkowski et al., 2001) for this purpose. This biotinylated 
peptide was shown to bind specifically to L cells that were transfected with IA g7 gene, but it does 
not bind to untransfected L cells (Fig. 13). This biotinylated peptide is used to select peptide- 
binding mutants from the yeast library that expresses stabilized IA g7 mutants. Such stabilized 
peptide-IA g7 complexes are then used to confirm their ability to regulate T cell activity. 
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The second goal has been to explore whether mutants of even smaller class II peptide 
binding modules can be produced by expressing only the N-terminal (a t and ft) domains of the 
IA 87 molecule (Chang et al., 2001). The single-chain constructions shown in Fig. 14 have now 
been generated. Yeast libraries with mutated versions of these proteins are selected for anti-IA g7 
and peptide binding in order to isolate even smaller agents that can be used to regulate T cells. 

Monoclonal or polyclonal antibodies, preferably monoclonal, specifically reacting with an 
MHC protein at a site other than the ligand binding site may be made by methods known in the 
art. See, e.g., Harlow and Lane (1988) Antibodies: A Laboratory Manual, Cold Spring Harbor 
Laboratories; Goding(1986) Monoclonal Antibodies: Principles and Practice, 2d ed., Academic 
Press, New York; and Ausubel et al. (1999) Current Protocols in Molecular Biology, John Wiley 
& Sons, Inc., New York. 

Stable Class II MHC proteins in cell-bound or soluble form which are bound to a 
particular peptide as a complex are useful, for example, as diagnostic probes for screening 
biological samples (such as cells, tissue samples, biopsy material, bodily fluids and the like) for 
the presence of T cells displaying a T cell receptor protein specific for the peptide-MHC complex. 
In addition, they can serve as antagonists of T lymphocyte-mediated destruction of cells or tissue 
expressing the particular peptide. Frequently, the stable Class II MHC proteins are labeled by 
joining, either covalently or noncovalently, a substance which provides a detectable signal. 
Suitable labels include but are not limited to radionuclides, enzymes, substrates, cofactors, 
inhibitors, fluorescent agents, chemiluminescent agents, magnetic particles and the like. 
Additionally the MHC protein can be coupled to a ligand for a second binding molecules: for 
example, the MHC protein can be biotinylated. United States Patents describing the use of labels 
and/or toxic compounds to be covalently bound to the Class II MHC stabilized protein or 
complex include, but are not limited, to Nos. 3,817,837; 3,850,752; 3,927,193; 3,939,350; 
3,996,345; 4,277,437; 4,275,149; 4,331,647; 4,348,376; 4,361,544; 4,468,457; 4,444,744; 
4,640,561;4,366,241;RE35,500; 5,299,253; 5,101,827; 5,059,413. Labeled (detectable) MHC 
Class II proteins can be detected using a monitoring device or method appropriate to the label 
used. Fluorescence microscopy or fluorescence activated cell sorting can be used where the label 
is a fluorescent moiety, and where the label is a radionuclide, gamma counting, autoradiography 
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or liquid scintillation counting, for example, can be used with the proviso that the method is 
appropriate to the sample being analyzed and the radionuclide used. In addition, there can be 
secondary detection molecules or particle employed where there is a detectable molecule or 
particle that recognized the proteins. The art knows useful compounds for diagnostic imaging in 
situ; see, e.g., U.S. Patent No. 5,101,827 or 5,059,413. Radionuclides useful for therapy and/or 
imaging in vivo include m Indium, 97 Rubidium, 125 Iodine, 131 Iodine, 123 Iodine, 67 Gallium, 
"Technetium. Toxins include diphtheria toxin, ricin and castor bean toxin, among others, with 
the proviso that once the MHC-toxin complex is bound to the cell, the toxic moiety is internalized 
so that it can exert its cytotoxic effect. Immunotoxin technology is well known to the art, and 
suitable toxic molecules include, without limitation, chemotherapeutic drugs such as vindesine, 
antifolates, e.g. methotrexate, cisplatin, mitomycin, anthrocyclines such as daunomycin, 
daunorubicin or adriamycin, and cytotoxic proteins such as ribosome inactivating proteins (e.g., 
diphtheria toxin, pokeweed antiviral protein, abrin, ricin, pseudomonas exotoxin A or their 
recombinant derivatives. See, generally, e.g., 01snesandPihl(1982)P/?arwac. Ther. 25:355-381 
and Monoclonal Antibodies for Cancer Detection and Therapy, Eds. Baldwin and Byers, pp. 
159-179, Academic Press, 1985. 

Stable, high affinity MHC proteins specific for a particular ligand, e.g., a particular 
peptide, protein or cell type, are useful in diagnosing animals, including humans, believed to be 
suffering from a disease associated with the particular pMHC. The MHC molecules of the present 
invention are useful for detecting T cells that are specific for essentially any antigen including, but 
not limited to, those associated with a neoplastic condition, an abnormal protein, or an infection 
or infestation with a bacterium, a fungus, a virus, a protozoan, a yeast, a nematode or other 
parasite. The proteins can also be used in the diagnosis of certain genetic disorders in which there 
is a stabilized MHC Class II abnormal protein produced. Exemplary applications for these stable, 
high affinity proteins is in the treatment of autoimmune diseases in which there is a known pMHC. 
Type I diabetes is relatively well characterized with respect to the autoantigens which attract 
immune destruction. Multiple sclerosis, celiac disease, inflammatory bowel disease, Crohn's 
disease and rheumatoid arthritis are additional candidate diseases for such application. Stabilized 
Class II MHC proteins with binding specificity for a particular peptide on the surface of cells or 
tissues which are improperly targeted for autoimmune destruction can serve as antagonists of the 
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autoimmune destruction by competing for binding to the target cells by T cells or by directly 
inactivating the T cell. Such stabilized MHC proteins can be obtained by the methods described 
herein and subsequently used for screening for T cells that are specific for a particular ligand of 
interest. 

The stabilized MHC compositions, especially as the stable mutant peptide-MHC chimeric 
protein or protein complex, can be formulated by any of the means known in the art. They can 
be typically prepared as injectables, especially for intravenous, intraperitoneal or synovial 
administration (with the route determined by the particular disease) or as formulations for 
intranasal or oral administration, either as liquid solutions or suspensions. Solid forms suitable 
for solution in, or suspension in, liquid prior to injection or other administration may also be 
prepared. The preparation may also, for example, be emulsified, or the protein(s)/peptide(s) 
encapsulated in liposomes. 

The active ingredients are often mixed with excipients or carriers which are 
pharmaceutically acceptable and compatible with the active ingredient. Suitable excipients include 
but are not limited to water, saline, dextrose, glycerol, ethanol, or the like and combinations 
thereof. The concentration of the MHC protein in injectable, aerosol or nasal formulations is 
usually in the range of 0.05 to 5 mg/ml. Similar dosages can be administered to other mucosal 
surfaces. 

In addition, if desired, vaccines may contain minor amounts of auxiliary substances such 
as wetting or emulsifying agents, pH buffering agents, and/or adjuvants which enhance the 
effectiveness of the vaccine. Examples of adjuvants which may be effective include but are not 
limited to: aluminum hydroxide; N-acetyl-muramyl-L-threonyl-D-isoglutamine (thr-MDP); N- 
acetyl-nor-muramyl-L-alanyl-D-isoglutamine (CGP 11637, referred to as nor-MDP); N- 

acetylmuramyl-L-alanyl-D-isoglutaminyl-L-alanine^^l'^'-dipalmitoyl-sn-glycero- 
3hydroxyphosphoryloxy)-ethylamine (CGP 19835 A, referred to as MTP-PE); and RIB I, which 
contains three components extracted from bacteria: monophosphoryl lipid A, trehalose dimycolate 
and cell wall skeleton (MPL+TDM+CWS) in a 2% squalene/Tween 80 emulsion. Such additional 
formulations and modes of administration as are known in the art may also be used. 
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The stabilized high affinity MHC Class II proteins of the present invention and/or pMHC- 
binding fragments having primary structure similar (more than 90% identity) to the high affinity 
MHC proteins and which maintain the improved stability and/or the high affinity for the cognate 
ligand may be formulated into vaccines as neutral or salt forms. Pharmaceutically acceptable salts 
include but are not limited to the acid addition salts (formed with free amino groups of the 
peptide) which are formed with inorganic acids, e.g., hydrochloric acid or phosphoric acids; and 
organic acids, e.g., acetic, oxalic, tartaric, or maleic acid. Salts formed with the free carboxyl 
groups may also be derived from inorganic bases, e.g., sodium, potassium, ammonium, calcium, 
or ferric hydroxides, and organic bases, e.g., isopropylamine, trimethylamine, 2-ethylamino- 
ethanol, histidine, and procaine. Alternatively, these high affinity MHC proteins can be used as 
antagonists of an interaction between endogenous MHC proteins of similar specificity and the 
cognate TCR cells. 

MHC proteins for therapeutic use, e.g., those conjugated to cytotoxic compounds are 
administered in a manner compatible with the dosage formulation, and in such amount and manner 
as are prophylactically and/or therapeutically effective, according to what is known to the art. 
The quantity to be administered, which is generally in the range of about 100 to 20,000 ug of 
protein per dose, more generally in the range of about 1000 to 10,000 ug of protein per dose. 
Similar compositions can be administered in similar ways using labeled high affinity MHC proteins 
for use in imaging, for example, to detect T cells that are involved in an autoimmune attack and 
express the TCRs that are specific for the tissue that is the target of the autoimmune attack. 
Precise amounts of the active ingredient required to be administered may depend on the judgment 
of the physician or veterinarian and may be peculiar to each individual, but such a determination 
is well within the skill of such a practitioner. 

The vaccine or other immunogenic composition may be given in a single dose; two dose 
schedule, for example two to eight weeks apart; or a multiple dose schedule. A multiple dose 
schedule is one in which a primary course of vaccination may include 1 to 10 or more separate 
doses, followed by other doses administered at subsequent time intervals as required to maintain 
and/or reinforce the immune response, e.g., at 1 to 4 months for a second dose, and if needed, a 
subsequent dose(s) after several months. Humans (or other animals) immunized with the 
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retrovirus-like particles of the present invention are protected from infection by the cognate 
retrovirus. 

Standard techniques for cloning, DNA isolation, amplification and purification, for 
enzymatic reactions involving DNA ligase, DNA polymerase, restriction endonucleases and the 
like, and various separation techniques are those known and commonly employed by those skilled 
in the art. A number of standard techniques are described in Sambrook et al. (1989) Molecular 
Cloning, Second Edition, Cold Spring Harbor Laboratory, Plainview, New York; Maniatis et al. 
(1982) Molecular Cloning, Cold Spring Harbor Laboratory, Plainview, New York; Wu (ed.) 
(1993) Meth. Enzymol 218, Part I; Wu (ed.) (1919) Meth Enzymol. 68; Wu et al. (eds.) (1983) 
Meth. Enzymol. 100 and 101; Grossman and Moldave (eds.) Meth. Enzymol. 65; Miller (ed.) 
(1972) Experiments in Molecular Genetics, Cold Spring Harbor Laboratory, Cold SpringHarbor, 
New York; Old and Primrose (1981) Principles of Gene Manipulation, University of California 
Press, Berkeley; Schleif and Wensink (1982) Practical Methods in Molecular Biology; Glover 
(ed.) (1985) DNA Cloning Vol. I and II, IRL Press, Oxford, UK; Hames and Higgins (eds.) 
(1985) Nucleic Acid Hybridization, IRL Press, Oxford, UK; and Setlow and Hollaender (1979) 
Genetic Engineering: Principles and Methods, Vols. 1-4, Plenum Press, New York. 
Abbreviations and nomenclature, where employed, are deemed standard in the field and commonly 
used in professional journals such as those cited herein. 

All references cited in the present application are incorporated by reference herein to 
supplement the disclosure and experimental procedures provided in the present Specification to 
the extent that there is no inconsistency with the present disclosure. 

The following examples are provided for illustrative purposes, and are not intended to 
limit the scope of the invention as claimed herein. Any variations in the exemplified articles and/or 
methods which occur to the skilled artisan are intended to fall within the scope of the present 
invention. 
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EXAMPLES 

Example 1 . Engineering of single-chain class II MHC, IA g7 . 

To construct a recombinant vector for the expression of a single chain Class II MHC 
protein, the IA g7 P-chain was PCR amplified using the forward primer (5'-ATT GCA GCT AGC 
GGT GGA CCT AAG GGT GGC GGC GGT TCT TT A GTT CCA AGA GGT TCT GGT GGC 
GGT GGC TCT GGA GAC TCC GAA AGG CAT TT-3', SEQ ID NO:2) incorporating 
NhellAflR restriction sites and 16 AA linker upstream of the P-chain and reverse primer (5'-TCC 
GCC ACC TCC AGA ACC TCC TCC GCC CCT CCA CTC CAC AGT GAT GGG-3', SEQ 
ID N0:3) containing the 9 AA linker downstream of the P-chain. The IA 8? a-chain was amplified 
with the forward primer (5'-GGC GGA GGA GGT TCT GGA GGT GGC GGA GAA GAC 
GAC ATT GAG GCC-3', SEQ ID N0.28) that contained the same 9 amino acid linker upstream 
of the a-chain and reverse primer (5'- ATT TGC AGA TCT TTA TCA CAA GTC TTC TTC 
AGA AAT AAG CTT TTG TTC CCA GTG TTT CAG AAC CGG CTC-3' SEQ ID N0:4) 
incorporating the c-myc epitope tag and Bglil diagnostic site downstream of the a-chain. PCR 
sewing was then used to fuse LA 87 P-chain and a-chain PCR products through and additional 
amplification using the P-chain forward primer and the a-chain reverse primer. 

The GAD65(78-96) scIA g7 construct was generated by PCR amplification of the scIA g7 
fusion product using a forward primer (5'-ATT GCA GCT AGC AAA CCA TGT AAT TGT 
CCA AAA GGT GAT GTT AAT TAT GCT TTT TTG CAT GCT ACT GAT CTT AAG GGT 
GGC GGC GGT TCT TTA GTT CCA-3', SEQ ID NO:5) that incorporated the GAD65(78-96) 
peptide between the Nhel and AflR restriction sites and the a-chain reverse primer. The 
generated scIA g7 and GAD65(78-96) scIA g7 constructs were digested with Nhel and BglU. and 
ligated to iWiel-i?g/II-digested yeast surface display vector pCT302 containing a nine-residue 
epitope tag (HA) and the AGA2 open reading frame downstream of the inducible GAL1 promoter 
(Boder and Wittrup, 1997). The ligation mixture was transformed into electro-competent E. coli 
DH10B (Gibco BRL/Invitrogen, Carlsbad, CA), and transformants were plated on LB plates 
supplemented with ampicillin at 100 |ig/ml and grown overnight at 37°C. 

Cassette ligations were used to generate B9-23 scIA g7 and BDC2.5(A) scIA 87 constructs. 
BDC2.5(A) peptide and B9-23insulin peptide sense (5'-CTA GCG GTA AAA AGG TTG CTG 
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CAC CAG CTT GGG CTC GTA TGG GTC-3', SEQ ID NO:6; 5'-CTA GCT CTC ATT TGG 
TTG AAG CTT TGT ATT TGG TTT GTG GTG AAA GAG GTC-3', SEQ ID N0:7) and anti- 
sense (5'-TTA AGACCC ATA CGA GCC CAA GCT GGT GCA GCA ACC TTT TTACCG-3', 
SEQ ID NO : 8; 5 '-TT A AGA CCT CTT TC A CCA CAA ACC AAA T AC AAA GCT TC A ACC 
5 AAA TGA GAG-3', SEQ ID NO:9) 5'-phosphorylated oligonucleotides with Nhel and AJTU. 
restriction site overhangs were mixed in equimolar ratios. Peptide-specific forward and reverse 
primers were incubated at 100°C for 1 minute, and allowed anneal at 25 °C, generating the 
peptide cassettes. The cassettes were then ligated to iW?el-4/?II-digested GAD65(78-96) 
scIA g7 /pCT302 and transformed into E. coli as described previously. Plasmid DNA was 
10 transformed into the yeast strain EBY100 according to published methods (Geitz et al., 1995). 
Transformants are selected for tryptophan prototrophy. 

Q Example 2. Flow Cytometric Analysis 

2. Figure 3 shows the results of flow cytometric analyses of various scIA g7 constructions. 

W The scIA 87 constructs were grown in SD-CAA (2% dextrose, 0.67% yeast nitrogen base, 1% 

W Casamino acids (CAA Difco, Detroit, MI)) at 30°C for 18-20 h. To induce surface scIA 8? 

y = 

5 expression, yeast cells were pelleted by centrifugation, resuspended to an OD 600 of about 1 .0 in 
S G-C AA (2% galactose, 0. 67% yeast nitrogen base, 1 % Casamino acids), and incubated at 20°C. 

tl After ~18-20 h of incubation, cultures were harvested, approximately 10 7 cells/tube were 

2D incubated on ice, washed with PBS (10 mM NaP0 4 , 150 mM NaCl, pH 7.3) containing 0.5% 
bovine serum albumin (BSA), and incubated for 1 hr with 25 uL of 10 (ig/mL anti-HA mAb 
12CA5 (Boehringer Mannheim, Indianapolis, IN), anti-c-myc mAb 9E10 (1:50 dilution of raw 
ascites fluid; Berkeley Antibody Co., Richmond, CA), or 10 u,g/ml anti- IA g7 mAb 1 0.216 purified 
from hybridoma supernatant. Cells were washed with PBS and incubated for 1 h on ice with 

25 FITC-labeled F(ab') 2 goat anti-mouse IgG (1:50) (Kirkegaard and Perry Labs, Inc. , Gaithersburg, 
MD). Labeled cells were washed with PBS and analyzed on a Coulter Epics XL flow cytometer. 
As observed in the flow histograms, the Aga-2-IA g7 fusion in single chain format (Aga-2-HA-P- 
chain-linker-a-chain-c-/wyc) was not detectable on the yeast, which is consistent with the 
instability of the class II product. The Aga-2-IA g7 fusion with 3 peptides (GAD65[78-96], B9- 

30 23 [Insulin], BDC2.5 [alanine stabilized variant]) linked at the IA g7 amino terminus of the P-chain 
exhibited low or undetectable levels of expression. 
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L-cells transfected with I-A g7 were tested for the direct binding of the biotinylated-I-A g7 
specific peptide 1040-63 (B-l-4-063, Biotin-RTRPLWVRME (RTRPLWVRME, SEQ ID 
NO:27). Transfected and non-transfected L-cells were grown until confluent and harvested. L- 
cells (~2x 1 0 5 cells/aliquot) were incubated with varying concentrations of the B- 1 040-63 peptide 
for 3 hours at 37°C in PBS/0.5% BSA, pH 6. Aliquots of cells were washed three times with 
PB S/0. 5% B S A, pH 7, and incubated for one hour with streptavidin-phycoerythrin (PE) conjugate 
(1 :200 dilution; PharMingen, San Diego, CA). L-cells were then washed three times in PBS/0.5% 
BSA, pH 7, and analyzed by fluorescence activated cell sorting using a Coulter Epics XL flow 
cytometer as described herein. Fig. 13 is a binding curve showing I-A g7 -positive L-cell binding 
to titrated amounts of B-l 040-63 peptide (1.6 uM-50 uM). At the peptide concentrations tested, 
non-specific binding of B-l 04-63 to I-A g7 -negative L-cells does not occur. 

Example 3. Mutagenesis of Class II MHC Fusion Proteins. 

The production of mutagenic libraries in order to generate stabilized MHC Class II, A 8? 
has been described. The random mutagenesis strategy employed the use of error-prone PCR and 
yeast homologous recombination to generate mutagenic libraries. The scIA 87 constructs were 
amplified using the flanking AGA-2-specific upstream primer 1 (splice 4/L, 5'-GGC AGC CCC 
ATA AAC AC A CAG TAT-3 SEQ ID NO: 1 0) and downstream primer 4 (T7 Promoter, 5'-TAA 
TAC GAC TCA CTA TAG GG-3', SEQ ID NO: 1 1) with an additional ~ 100 bp upstream and 
-300 bp downstream extending into the display vector (pcT302). Random nucleotide errors were 
incorporated into scIA g7 constructs using Taq polymerase (Gibco BRL/Invitrogen, Carlsbad, CA) 
in the presence of 2 mM MgCl 2 and 0.3 mM MnCl 2 . The directed mutagenesis strategy utilized 
PCR sewing and yeast homologous recombination to mutate the p56 and 057 residues of scIA 87 
p-chain. This process involved: a PCR sewing step with primer 3 (5'-TAC CGC GCG GTG ACC 
GAG CTC GGG CGG NNS NNS GCC GAG TAC TAC AAT AAG C-3', SEQ ID NO: 12) 
degenerate (where N is any nucleotide and S is C or G) at each position to be varied; reverse 
primer 2 (5'-CCG CCC GAG CTC GGT CAC CGC GCG GTA CTC GCC CAC GTC G-3', SEQ 
ID NO: 13) complementary to the 18 bases at the 5' end of this primer; and primer 1 (splice 4/L) 
and primer 4 (T7 promoter) flanking primers that amplify the entire scIA 87 construct. The 
underlined bases in primer 3 and primer 2 indicate the position of a silent mutation introducing 
a Sad restriction site into the construct. Approximately 150 ng of linear random or directed 
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mutagenic PCR product and ~1 50 ng of Nhe l-5g/II-digested 7M sc4Fl 0 pCT302 were combined 
(per transformation) and transformed into S. cerevisae EBY100 yeast by electroporation to 
generate libraries, with selection for growth in the absence of tryptophan. Transformants were 
pooled into ~250-ml SD-CAA and grown -48 hr at 30° C. Isolated clones from each mutagenic 
5 strategy were rescued and sequenced to verify mutagenesis. Approximately 4-7 nucleotide errors 
were incorporated per 1000 base pairs in the random scIA g7 libraries. 

Example 4. Production of Mutant Libraries of scMHC Class II Protein 

In order to generate stabilized MHC Class II, A g7 , the random mutagenesis strategy 
10 summarized in Fig. 4 was employed. This strategy used error-prone PCR and yeast homologous 
recombination to generate mutagenic libraries. The scIA 87 constructs were amplified using the 
flanking AGA2-specific upstream primer 1 (splice 4/L, 5'-GGC AGCCCC ATA AAC ACACAG 
p TAT-3', SEQ ID NO: 14) and downstream primer 4 (T7 Promoter, 5'-TAA TAC GAC TCA CTA 
5 TAG GG-3', SEQ ID NO: 15) with an additional ~ 100 bp upstream and -300 bp down stream 
1:5; extending into the display vector (pCT302). Random nucleotide errors were incorporated into 

W scIA g7 constructs using Taq polymerase (Gibco BRL/Invitrogen, Carlsbad, CA) in the presence 

in 

g of 2 raM MgCl 2 and 0.3 mM MnCl 2 . The directed mutagenesis strategy utilized PCR sewing and 

=H yeast homologous recombination to mutate the £56 and P57 residues of scIA 87 P-chain. This 
N; process involved: a PCR sewing step with primer 3 (5'-TAC CGC GCG GTG ACC GAG CTC 
2J GGG CGGNNS NNS GCC GAG TAC TAC AAT AAG C-3', SEQ ID NO: 12) degenerate (N 
is any nucleotide and S is C or G) at each position to be varied; reverse primer 2 (5'-CCG CCC 
GAG CTC GGT CAC CGC GCG GTA CTC GCC CAC GTC G-3', SEQ ID NO: 13) 
complementary to the 18 bases at the 5' end of this primer; and primer 1 (splice 4/L) and primer 
4 (T7 promoter) flanking primers that amplify the entire scIA g7 p56(357 construct. Underlined 
25 bases in primer 3 and primer 2 indicate the position of a silent mutation introducing a Sad 
restriction site into the construct. Approximately 150 ng random or directed mutagenic PCR 
product and -150 ng of A%el-Z?g/II-digested 7M sc4F10 pCT302 were combined (per 
transformation) and transformed into S. cerevisiae EBY100 yeast by electroporation to generate 
libraries. Transformants were pooled into ~250-ml SD-CAA and grown -48 hr at 30°C. Isolated 
30 clones from each mutagenic strategy were rescued and sequenced to verify mutagenesis. 
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Approximately 4-7 nucleotide errors were incorporated per 1000 base pairs in the random scIA g7 
libraries. 

Yeast displaying the GAD65scIA g7 WT and GAD65 scIA g7 f35657Mut2 fusion proteins 
were analyzed by flow cytometry. GAD65scIA g7 WT/yeast and GAD65 scIA g7 f35657Mut2/yeast 
cells were stained with anti-IA g7 mAb 10.216 and anti-c-myc mAb 9E10 followed by FITC-labeled 
F(ab') 2 goat anti-mouse IgG. Labeled cells were analyzed on a Coulter Epics XL flow cytometer. 
The resultant GAD65scIA g7 WT (unshaded) and GAD65 scIA g7 p5657Mut2 (shaded) histograms 
are shown in Figs. 10A-10B. A positive population shift of the GAD65 scIA g7 J35657Mut2/yeast 
was observed when compared to the GAD65scIA g7 WT /yeast indicating an increased surface 
levels (i.e. increased stability). GAD65 scIA g7 J35657Mut2 contained HP56E and SP57V 
mutations, as determined by sequence analysis. 

Example 5. Flow Cytometric Analysis of Mutant Libraries. 

Yeast displaying the B9-23scIA g7 wild-type(WT) and B9-23scIA g7 Mut8 Aga-2 fusions 
were analyzed by flow cytometry. B9-23scIA g7 WT/yeast and B 9-23 scIA g7 Mut8/y east were 
stained with anti-IA g7 mAb 10.216 and anti-c-myc mAb 9E10 followed by FITC-labeled F(ab') 2 
goat antirmouse IgG. Labeled cells were analyzed on a Coulter Epics XL flow cytometer. The 
resultant B9-23scIA g7 WT (unshaded) and B9-23scIA g7 Mut8 (shaded) histograms are shown in 
Fig. 8. A positive population shift of the B9-23scIA g7 Mut8/yeast was observed when compared 
to the B9-23scIA g7 WT/yeast indicating increased surface levels (i.e. increased stability). A 
negative population has been observed for all yeast-displayed proteins. Without wishing to be 
bound by any particular theory, we believe that this is caused by yeast at a stage of growth or 
induction incapable of expressing the surface fusion protein. B9-23scIA g7 Mut8 was determined 
to have the following mutations: G013A, S357L, Wa43S, and Vccl39D. 

Example 6. Analysis of Clones Isolated by Sorting. 

The mutant clones isolated by cell sorting from the GAD65(78-96) scIA g7 and the B9-23 
scIA g7 error-prone PCR libraries were further analyzed. Binding levels are shown as % positive 
population shift to mti-c-myc mAb and anti- IA g7 mAb for GAD65(78-96) scIA g7 and B9-23 
scIA g7 clones isolated. Six mutants in addition to GAD65(78-96) scIA g7 WT/yeast, B9-23 
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scIA g7 WT/yeast, 7Msc4F10/yeast (mti-c-myc mAb positive control), and S. cerevisiae EBY100 
(negative control) were induced in galactose medium overnight at 30°C. Cells were analyzed by 
flow cytometry after staining with anti-c-zwyc followed by FITC-labeled F(ab') 2 goat anti-mouse 
IgG (more shaded bars), or stained with anti- IA g7 mAb followed by FITC-labeled F(ab') 2 goat 
anti-mouse IgG (less shaded bars). Mutants isolated yielded higher surface level from the FACS 
procedure binding to mti-c-myc and anti- IA g7 antibodies than wild-type counterparts (except for 
GAD65Mutl4). This correlates with increased protein stability. GAD65Mutl4 has previously 
shown higher surface display than GAD65WT. Low expression levels in this study could possibly 
be attributed to experimental error. 

Example 7. Rapid (one day) Sequential Sorting of BDC2.5. 

The BDC2.5 scIA g7 05657 yeast library was stained with 12.5 ul anti-IA g7 mAb 10.216 
(10 ug/mi) and 12.5 ul biotin-Iabeled anti-c-myc mAb 9E10 (1:100) (Berkeley Antibody Co., 
Richmond, CA), washed with buffer (PBS/0.5% BSA), and incubated with 12.5 ul FITC-labeled 
F(ab') 2 goat anti-mouse, y 2 b chain specific, IgG 2b (1 :50) (SouthernBiotechnology Associates, Inc., 
Birmingham, AL) and streptavidin-phycoerythrin (SA-PE) conjugate (1:100) (PharMingen, San 
Diego, CA). After washing, samples were sorted in purification mode (coincident negative cells 
rejected) on a Cytomation MoFlo sorter (Cytomation, Fort Collins, CO). A total of 2 x 10 7 cells 
were examined during the first sorting round, collecting ~ 1% of the population. The collected 
cells were sequentially sorted twice more on the same day collecting the top ~ 1% of the 
population. The cells collected from the third sort were plated onto selective glucose plates and 
grown for ~48 h. Isolated clones were further examined by flow cytometry. See Fig. 1 1 . 

Example 8. Summary of Clones Isolated by Sorting from BDC2.5 scIA g7 p5657 library. 

Binding levels are shown as a % positive population shift to mti-c-myc mAb and anti- IA g7 
mAb from BDC2.5 scIA g7 f$5657 clones isolated from the final sequential sort. Nine mutants in 
addition to BDC2.5 scIA g7 WT/yeast, B9-23 scIA g7 Mut8/yeast (anti- IA g7 mAb positive control), 
7Msc4F10/yeast (mti-c-myc mAb positive control), and EBY100 (negative control) yeast were 
induced in galactose medium overnight at 30°C. Cells were analyzed by flow cytometry after 
staining with anti-c-myc mAb (stippled bars), or stained with anti- IA g7 mAb (shaded bars) 
followed by FITC-labeled F(ab') 2 goat anti-mouse IgG. Mutants isolated yielded higher surface 
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level binding to anti-c-myc and anti- IA g7 antibodies than its BDC2.5 scIA g7 WT counterpart. 
BDC2.5 scIA g7 J35657 mutants were sequenced and contained consensus motifs ofE/G S6 and 
L/M 57 . 

Example 9. Engineering of single-chain I-A g7 p^ MHC class II fusion proteins. 

To generate scI-A g7 p^ fusions (Aga-2-p r domain-a 1 -domain-c-m^c), the scI-A g7 
constructs (Fig. 2) were amplified through a process termed PCR sewing as described herein. 
Briefly, an Aga-2 specific upstream primer (splice 4/L, SEQ ID NO: 1 4) and a downstream I-A g7 pi 
domain specific primer (B 1 Al reverse-sew, 5'-T AC GTG GTC GGC CTC AAT GTC GTC TTC 
AAG CCG CCG CAG GGA GGT GGG GAC CTC-3*, SEQ ID NO:25) were used to amplify the 
5' end and p x domains of the scI-A g7 constructs. An I-A 87 ^ domain specific primer (B1A1 
forward-sew, 5'-GAG GTC CCC ACC TCC CTG CGG CGG CTT GAA GAC ATT GAG GCC 
GAC CAC GTA-3', SEQ ID NO: 27), containing 18 bases of the 5' end complementary to the 
previous primer, and a primer coding for the 3' end of the a x domain and z-myc epitope tag (Al- 
reverse-c-myostop, 5'-CAA TAG AGA TCT TTA TCA CAA GTC TTC TTC AGA AAT AAG 
CTT TTG TTC ATT GGT AGC TGG GGT GAA ATT TGA CCT C-3', SEQ ID NO:26) were 
used to amplify the entire a t domain. The resultant PCR products were mixed and PCR amplified 
using Splice 4/L and Al-reverse-c-myc-stop flanking primers to generate the full-length fusion 
constructs. The scI-A g7 PjOij fusion PCR-products were digested with Nhel and Bglll and ligated 
into the Nhel and Bglll-digested yeast surface display vector, pCT302. The ligation mix was 
transformed into E. coli DH10B, and rescued plasmid DNA was then used to transform S. 
cerevisiae strain EBY100 as described herein. Shown are schematics of several scI-A g7 p^ 
fusions that were generated, including the various peptide sequences incorporated at 5' end of the 
48 base-pair linker. See Fig. 14. 
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Table 1. 
Results of Mutagenic Libraries 



Random Mutagenesis 

scIA e7 

BDC2.5(A)scIA g7 
B9-23 scIA 87 
GAD65(78-96 scIA g7 

BDC2.5(A) scIA* 7 #1 

BDC2.5(A) scIA 87 #2 

Directed Mutagenesis (P56/57) 

GAD65(78-96 scIA 87 (35657 
BDC2.5(A) scIA 87 05657 



# Transformants 

1.7 x 10 6 

1.4 x 10 6 
1.2 x 10 6 
1.2 x 10 6 

3.0 x 10 5 

3.7 x 10 6 



4.3 x 10 4 
1.0 x 10 5 
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Table 2. 

Nucleotide and Amino Acid Sequences 

Nucleotide and amino acid sequences for the scl- A g7 construct (a-chain-linker-p-chain-c-wyc) and 
5 3 peptides (GAD65 [78-96], B9-23 [Insulin], BDC2.5 [alanine stabilized variant]). 

scI-A g7 construct (a-chain-Iinker-a-chain-c-wyc) 

nucleotide sequence (SEQ ID NO: 16) 

GCTAGCGGTGGACTTAAGGGTGGCGGCGGTTCTTTAGTTCCAAGAGGTTCTGGTGGCGGTGG 
1 0 CTCTGGAGACTCCGAAAGGCATTTCGTGCACCAGTTCAAGGGCGAGTGCTACTTCACCAACG 
GGACGCAGCGCATACGGCTCGTGACCAGATACATCTACAACCGGGAGGAGTACCTGCGCTTC 
GACAGCGACGTGGGCGAGTACCGCGCGGTGACCGAGCTGGGGCGGCACTCAGCCGAGTACTA 
CAATAAGCAGTACCTGGAGCGAACGCGGGCCGAGCTGGACACGGCGTGCAGACACAACTACG 
L; . AGGAGACGGAGGTCCCCACCTCCCTGCGGCGGCTTGAACAGCCCAATGTCGCCATCTCCCTG 
IS TCCAGGACAGAGGCCCTCAACCACCACAACACTCTGGTCTGTTCGGTGACAGATTTCTACCC 
ff AGCCAAGATCAAAGTGCGCTGGTTCAGGAATGGCCAGGAGGAGAAGTGGGGGTCTCATCCAC 
Lfi ACAGCTTATTAGGAATGGGGACTGGACCTTCCAGGTCCTGGTCATGCTGGAGATGACCCCTC 
'fl ATCAGGGAGAGGTCTACACCTGCCATGTGGAGCATCCCAGCCTGAAGAGCCCCATCACTGTG 
if! GAGTGGAGGGGCGGAGGAGGTTCTGGAGGTGGCGGAGAAGACGACATTGAGGCCGACCACGT 
20 AGGCTTCTATGTACAACTGTTTATCAGTCTCCTGGAGACATTGGCCAGTACACACATGAATT 
TGATGGTGATGAGTTGTTCTATGTGGACTTGGATAAGAAGAAAACTGTCTGGAGGCTTCCTG 

s y 

AGTTTGGCCAATTGATACTCTTTGAGCCCCAAGGTGGACTGCAAAACATAGCTGCAGAAAAA 
2 CACAACTTGGGAATCTTGACTAAGAGGTCAAATTTCACCCCAGCTACCAATGAGGCTCCTCA 
hi AGCGACTGTGTTCCCCAAGTCCCCTGTGCTGCTGGGTCAGCCCAACACCCTTATCTGCTTTG 
25 TGGACAACATCTTCCCACCTGTGATCAACATCACATGGCTCAGAAATAGCAAGTCAGTCACA 
GACGGCGTTTATGAGACCAGCTTCCTCGTCAACCGTGACCATTCCTTCCACAAGCTGTCTTA 
TCTCACCTTCATCCCTTCTGATGATGACATTTATGACTGCAAGGTGGAGCACTGGGGCCTGG 
AGGAGCCGGTTCTGAAACACTGGGAACAAAAGCTTATTTCTGAAGAAGACTTGTGATAAAGATCT 

30 amino acid sequence (SEQ ID NO: 17) 

ASGGLKGGGGSLVPRGSGGGGSGDSERHFVHQFKGECYFTNGTQRIRLVTRYIYNREEYLRF 
DSDVGEYRAVTELGRHSAEYYNKQYLERTRAELDTACRHNYEETEVPTSLRRLEQPNVAISL 
SRTEALNHHNTLVCSVTDFYPAKIKVRWFRNGQEETVGVSSTQLIRNGDWTFQVLVMLEMTP 
HQGEVYTCHVEHPSLKSPITVEWRGGGGSGGGGEDDIEADHVGFYGTTVYQSPGDIGQYTHE 

35 FDGDELFYVDLDKKKTVWRLPEFGQLILFEPQGGLQNIAAEKHNLGILTKRSNFTPATNEAP 
QATVFPKSPVLLGQPNTLICFVDNIFPPVINITWLRNSKSVTDGVYETSFLVNRDHSFHKLS 
YLTFIPSDDDIYDCKVEHWGLEEPVLKHWEQKLISEEDL 
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BDC2.5(A) Peptide 

CODING sequence (SEQ ID NO: 18) 

GGTAAAAAGGTTGCTGCACCAGCTTGGGCTCGTATGGGT 

amino acid sequence (SEQ ID NO: 19) 
GKKVAAPAWARMG 

GAD65(78-96) Peptide 

coding sequence (SEQ ID NO:20) 

AAACCATGTAATTGTCCAAAAGGTGATGTTAATTATGCTTTTTTGCATGCTACTGAT 

amino acid sequence (SEQ ID NO: 22) 
KPCNCPKGDVNYAFLHATD 

B9-23 Insulin Peptide 

coding sequence (SEQ ID NO:23) 

TCTCATTTGGTTGAAGCTTTGTATTTGGTTTGTGGTGAAAGAGGT 

amino acid sequence (SEQ ID NO:24) 
SHLVEALYLVCGERG 
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